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Abstract. We consider the quantum expectation value yi = {ip\A\ip) of an observable A over 
the state \ip). We derive the exact probability distribution of A. seen as a random variable when 
varies over the set of all pure states equipped with the Haar- induced measure. The probability 
density is obtained with elementary means by computing its characteristic function, both for non- 
degenerate and degenerate observables. To illustrate our results we compare the exact predictions 
for few concrete examples with the concentration bounds obtained using Levy's lemma. Finally 
we comment on the relevance of the central limit theorem. 



1. Introduction 

The role of probability distributions in quantum theory cannot be overestimated. Arguably the 
most important of those distributions is the one describing the statistics of possible outcomes of 
the measurement of the observable associated with the self-adjoint operator A while the system is 
in the pure state \tp): Pa (a) := {ip\S (A — a) |?/'). This function is supported on the numerical range 
of A and, for bounded A can be equivalently characterized by the set of its moments (see e.g. [T]): 
rrik '■= {il'\A''\tp) [k S N), i.e., the expectation values of the family of observables {A'^jfegN in the 
state This latter, quite often, can be itself regarded as a random variable distributed according 
to some prior density that depends on the problem under consideration. For example, in the 
context of equilibration dynamics of closed quantum systems |51 [3] one is interested in the quantity 
a{t) := {ifi{t)\A\ilj{t)), where \ip{t)) :— e~**^|V') and H is the Hamiltonian operator of the system. 
If one monitors A by sampling time instants uniformly over a the interval [0, T] the underlying 
probability space for the (t))'s is the segment [0, T] equipped with the uniform measure dt/T. In 
this case averaging over the quantum states amounts to perform the time average 1/T a{t)dt. 

Another possibility that recently gained relevance in the foundation of statistical mechanics |1] 
is to consider and let vary over the full unit sphere of the, say d-dimensional, Hilbert 

space space. This manifold is transitively acted upon by the group of all c? x d unitary matrices 
U (d) and therefore inherits a natural invariant measure from the unique group-theoretic invariant 
measure over U (d) i.e., the Haar measure. 

In this paper we will address precisely this latter setting and compute the probability 
distribution for the quantum expectations {ip\A'^ lip) seen as random variables over the unit sphere 
of the Hilbert space equipped with the measure induced by the Haar measure. In this way the 
function Pa itself becomes a sort probability-density valued random variable that can be partially 
characterized by the probability densities of its moments toj, over the unit sphere. We will show 
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that these probabiHty densities can be determined with elementary tools and explicit analytical 
expressions for their characteristic functions can be obtained. 

After the completion of this work we became aware that similar results are contained in the 
recent works |5][SJ[23]. See also [7] for an entry into the mathematical literature. 



2. Preliminaries 



Our key object is A{4>) = Throughout this paper we write f {tp) = J Dip f (tp) for Haar- 

induced averages over pure states. Computing the first few moments of A (V') is a relatively easy 
task. The first moment reads 

m,=A= f DxbtTiA\i;){i;\) = ^^. (1) 



a result which follows from \tp){ip\ = 1/d A closed formula for the general moment can be 
obtained by noting that 
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n — 1 \ ni ^ — ' 



n 



Here is the operator that enacts the permutation tt in 'H'^" and Sn is the symmetric group of 
n elements. For a proof of ^ see e.g. |^. The proportionality constant is obtained noting that 

^ ^ ^ is the dimension of the totally symmetric space, and the remaining operator is an 



orthogonal projector. Using eq. ^ one obtains the following closed expression for the rt-th moment 
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m — 



(d + n-l) 



7reS„ 



^ ' 7re5„ ai = l (T„ = l 

This expression is a sum of contractions and represents a sum of products of traces of A. For 
instance for n = 2, 3 one has 

_ tr {Af + tr {A^) _ tr {Af + 3 tr (A^) tr (A) + 2 tr {A^) 

™' ~ did+1) ' " d3 + 3^2 + 2d ■ ^"^^ 

In any case it seems difficult to obtain the probability density with this approach. Instead our 
procedure will be that of computing directly the characteristic function x i^) e'-^-^f'^) and obtain 
the probability density by Fourier transforming. 

Choosing a basis (j — 1, . . . ,d) and calling Zj ~ we observe that we can write the 

average over \ip) as 



fW^cj5^\z,\'~l^f{i/j)d'z 



(5) 
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where we defined 

dxidyi 



poo pC 



= 1 ' 

The normahzation constant C can be computed with the same technique that we are going to show 
and it turns out to be equal to (d — 1)!. Using the Fourier representation for the delta function in 
eq. ([5]) we obtain a Gaussian integral that can be computed. For simplicity we use the basis that 
diagonalizes A: A — 0Lj\i){3\- Calling Da = diagjoi, 02, ... , a^} we can write the characteristic 
function as 

J J -00 27r 

As customary we introduced a small positive e in order to make the Gaussian integral absolutely 
convergent. The Gaussian integration gives 

{d-\)\ r°° dr e-'"- 

At this point we make the important assumption that all eigenvalues of A are non-degenerate. We 
will treat the general case in section |3] Under these conditions the integrand in eq. Q has only 
simple poles and the integral is easily evaluated with residues closing the circle in the in the lower 
half-plane. The result is, after sending e — > 0, 

Although it might not be readily apparent from eq. (l7|, x (A) is actually regular in A = 0. This fact 
follows from a set of identities proven in [Appendix A stating that 



Applying eq. ^ to eq. Q we thus see that x (A) is regular at A = and being a linear combination 
of analytic functions it is in fact analytic in the whole complex plane (entire). A simple way to 
remember that x (A) must be regular at A = is to note that x (A) = 1 + mi (iX) + O (A^) . In fact 
this very same approach can be used to prove eq. (|8|. Using eq. (|8| and eq. ([t]) we readily obtain 
the Taylor series of x (A) 

x(A) = ^™„i^ (9) 



ri=0 



1 (10) 



n 



Equation (|10| is a quite compact expression in place of the complicated eq. ([3|. Equating eq. (10 1 
with eq. (|3| we obtain the following non-trivial set of matrix identities valid when the spectrum of 
A is non-degenerate 



^ 4-.. \®n\ _ \^k) 



tr (P^A«^") = 



7reS„ fc= 



1 rij/fc (a/c - flj) 
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Since x (-^) is entire, analytic continuation is trivial and the moment generating function 
M {y) :— ey-^ can be simply obtained by setting iX ^ y in eq. ([t]). 

To obtain the probability density we must compute the Fourier transform of x 

^e-^^\{X). (11) 

We make use of the following trick. Since x W is well behaved in zero and decays at infinity, it 
belongs to the space of tempered distribution S' (M). The Fourier transform is well defined for such 
functions and the result is again a tempered distribution. Actually for d > 3, x is also summable 



and so its Fourier transform can be safely defined by the absolutely converging integral eq. (111. 
We proceed than forgetting about the behavior in A = and compute the Fourier transform as a 
linear combination of Fourier transform of the functions e*^'^'=/ {iX)'^ ^. Such functions however are 
not well behaved in zero (they are not distributions) and must be regularized in order to compute 
the Fourier transform. Since the total Fourier transform is well defined, the result cannot depend 
on the regularization. A convenient regularization of 1/A" for n integer, is to be proportional to 
the n-th derivative of In |A| (" A""" (-1)""^ dx In |A| / (n - 1)!). With this definition the Fourier 
transform, in the sense of distribution, is 

dX e'^y _ ?/"-isign(2/) 
, 2n (iA)" ^ 2(n-l)! ' 
Using this equation and eq. Q we obtain 



^ _ (c? - 1) (flfc - x) sign (ofc - x) 



By construction, P (x) has all the properties of a probability density, moreover, since the 
probability of obtaining a value outside the numerical range of A must be zero, P (x) must 



be supported in [minjafc}, maxjafe}] which, once again, is not readily apparent from eq. (12 1. 
That P (x) is compactly supported follows from Paley- Wiener theorem since it is the Fourier 
transform of an (M) analytic function. In any case a direct proof that P (x) — for x outside 



[minjafc}, maxjafe}] is given in Appendix A and follows directly form eq. (A. 3 1. 

Generically P {x) is a bell-shaped curve supported in [minjafe}, maxjafe}] as can be see from 
fig. [l] The smoothness properties of P (x) can be understood noting that P (x) is a piecewise 

constant function (a part from a bunch of deltas). 

2.1. Cumulative distribution function 

Having computed the probability density, the characteristic, and the moment generating function, 
we would like now to compute the cumulative distribution function (CDF) to complete the picture. 
The CDF can be obtained via the following integral 



cdf {x) = Prob {A<x)=^ ( P (t) , 



V }dt. 

-n 

as long as — < minjafe}. The integration of each term in eq. ( [l2| gives 

(t-ak) sign [t- ak)dt = sign [x - ak) — t-, — tt \ t-, — tt 

n {d-1) [d-l) 
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Figure 1. Plot of the probability density eq. l |12| for various d. In this particular case we set 
Ofc = k/d. 



Clearly the result cannot depend on f2 as long as — f2 < min{afe}. In fact, using again formula (A. 3 1 
of [Appendix A[ the sum of all the terms containing il gives 



1 (—51 — ak)'^ ^ 1 
All in all the final expression for the cumulative is 



cdf i^) = \ + \Y1 ^^S^ 



Ofe) 



k 



{x - ak) 



ak) 



(13) 



(14) 



3. General case 



So far we assumed that the spectrum of A was non-degenerate. Although this is the most common 
situation, one may want to consider the case where A is a projector. We then turn to the fully general 
case where the spectrum of A consists of I distinct eigenvalues {ojlj^j^ with degeneracy Uj satisfying 



d. For a change, let us compute the moment generating function M [y, fi) := e^(-^ 



instead of the characteristic function. The shift constant is chosen to satisfy O > max{aj} so that 
for ?/ > the Gaussian integral is well defined. Gaussian integration gives the same result as for 
the non-degenerate case, which we now re-write as 

(d-1)! dr 



{-iT J-oo 2^ n -^1 (r - r,T^ 
To compute the above integral with the residues we must compute the following derivatives 

1 



Res 



Tk 



ink - 1) 



UUkir-r,r 



(15) 



(16) 



Alternatively, another possibility is to use the Chinese Remainder Theorem (see e.g. [TD])||]to get 
the partial fraction decomposition of the function in eq. (15 1. This way one expresses that function 



I LCV whishes to thank Pierre- Yves Gaillard for pointing out this connection. 
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as a sum of functions with simple poles so that the residues can be evaluated straightforwardly. 



The results coincide and in fact eq. ( 16 1 provides a way to obtain the partial fraction decomposition 



of the function in (151 by differentiation. The procedure is outlined in A.l and the final result for 



the moment generating function is 



Miy^n) = (d-1)!^ Yl 



yd+Mk-nk (nfc - 1 - Mk)\ 



which correctly reduces to eq. (A.l I when all Uk — 1. As for the non-degenerate case, this function 
is regular &t y — 0, and it is fact an entire function. The probability density can be obtained with 
the same arguments used for the non-degenerate case and the result is 



(afe - xf+^^''-^>'-^ sign (afc - x) (-1)*'^ {d - 1)! 



^^^^Q 2 {d + Mk - uk - 1)! (nfe - 1 - Mfc)! 



X 



As a concrete example we will consider in some detail the case where A is a one-dimensional 
projector. 

3.1. Probability density for the random guess 

The case where is a one-dimensional projector is of some relevance and we discuss it with some 
detail. Because of unitary invariance of the measure we can fix the reference state to be |1) in 
some basis, i.e. A — |1)(1|. In this case the quantity A that we are considering is the fidelity 
between a reference state |1) and a random vector i.e. A = |(1|V')|^ (normally written as T). 
It is a standard textbook exercise (see e.g. |11| chapter two) to compute the average random guess. 
Using eq. ([I]) one gets F — l/d, but what is the form of the probability distribution? Since A is a 
one-dimensional projector it has two eigenvalues ai = 1 and 02 = with multiplicity ni = 1 and 



712 = rf — 1 respectively. Inserting these values in eq. (17) we obtain for the moment generating 
function 

{ y{l-n) d-2 n ^ 

M^(y,0) = (d- 1)! <^ — ^ - y — — — . (19) 

Once again, despite its appearance, this function is regular at y = and in fact entire. Rearranging 
the terms in the sum we can write it as 



Setting 51 = we readily obtain the moments as 
riUd-lW 
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which agrees with a result of Von Neumann |12|[T^ . The probability density can be obtained either 
by Fourier transforming eq. (19 1 or by setting ai = 1, ni = 1 and a2 — 0, n2 = d — I in eq. (18 1. 
The sum over M2 becomes a binomial and the result is surprisingly simple 

(x) = (d - 1) (1 - xf-^ I[oa] {x) . (21) 

Here we denoted by I[o,i] (x) the indicator function of the set [0,1], i.e. I[o.i] (x) — 
[sign (x) + sign (1 — x)] /2. Eq. (21 1 is the so-called beta distribution with parameters a = 1 and 
f3 = d — 1. In Bayesian statistics, the beta distribution can be seen as the posterior probability 
of the parameter p of a binomial distribution after observing a — I successes (with probability of 
success given by p) and /3 ~ 1 failures (with probability of failure 1 ~ p). The CDF is given by a 
simple integration and reads 

cdijr{x) = 1- {1- xf~\ forO<a;<l, (22) 

whereas cdfjr {x) — 0, (1) for x < {x > 1) respectively. 



4. Comparison with Levy's lemma 

A typical approach to gain information on the concentration properties of a random variable X, is 
to compute the first few moments of the variable and then use variations of Markov's or Chebyshev's 
inequalities to obtain a bound on Prob (X — X > e). Alternatively in some cases one can use the 
Levy's lemma which typically provides tighter bounds. Since we obtained the probability density 
exactly we would like to compare the exact concentration prediction with that obtained by Levy's 
bound. 

Roughly speaking Levy's lemma states that, for a vector in a large-dimensional hypersphere 
the probability that a (sufficiently smooth) function is far from its average is exponentially small 
in the dimension of the space. There are many versions of Levy's lemma involving either the mean 
or the median or differing for the definition of being "far from". To be specific we use the following 
version [H]: for x G S'^ = {y G M''+^, \\y\\ = l}, and for a function / with Lipschitz constant rj 

Prob {/ (x) - (/) > e} < 2 exp (-Ci (fc + 1) eV^') (23) 

with Ci = (9Tr^ In 2) ^ . In our setting lives in a d dimensional complex space so k — 2d — 1. 

We will compare the prediction of the Levy's lemma to our exact result for two concrete 
examples of operator A. 



4-.1. Random guess 

First we consider again the fidelity of the random guess, in which case A is a one-dimensional 
projector. As we mentioned previously, the average fidelity is = 1/d, so to compute the LHS 
of eq. (23 1 it suffices to notice that Prob {J- > x) — 1 — cdfjr (x) and set x = T + e = l/rf + ein 
eq. (p2L Then 



Froh(T-T> e) = 1 - 



1 + ed 



p-(<i-l)/(rf(l-e)) 
1 

1 - e 



,ln(l-c)d 



(24) 
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where the last equation has been obtained in the Hniit of e small and d large. The Lipschitz constant 
of the function A {(j)) — {4)\A\(f)) has been calculated in [TS] Appendix A (see also where it has 
been shown that \A{cf)i) ~ A{4>2)\ < 2 ||^|| | — |</>2) | where is the operator norm of A 

(maximum singular value). In our case A = ]l)(l| so = 1 and we may take r] = 2 in eq. (23l 

above. So the Levy's lemma predicts 



Prob (J" - -F > e) < e^'^^''* 



C = 



1 

187r3 In 2 



1 

387 



As we see comparing with eq. (24 1 Levy's bound pays a very small pre-factor in the exponential 



compared to the exact behavior and a quadratic rather than a linear dependence on the error e. 



4-. 2. Number operator 

Let us now take an example from the non-degenerate case and take A to be the number operator N, 
i.e. the operator with ak — k for k — 1,2, ... , d. For this operator we have A = tv A/d = {d + 1) /2. 
Since the norm of N grows linearly with d we do not expect concentration to take place in this 
case. Now 



Prob {A-A>e} = l-cdf 



1 



e =:B{d,e) 



We investigated numerically the function B (d, e) using eq. (14 1. Our results indicate that for 

TV = d 



large d and small e, B {d, e) ~ ae where the constant C sa 0.25. The operator norm is 
so Levy's lemma in this case predicts a bound independent of d: 



B{d,e) 
C 



< 2e 



0.0103. 



1 

97' 



97r3 In 2 

From this comparison we can draw similar conclusions as for the case of the random guess. 
Namely Levy's lemma predicts a qualitatively correct behavior in d, but with a much larger constant: 
C w 24C". The quadratic dependence on the error is typically smaller and in both cases seen here, 
it was seen in fact to be linear for small enough e. 



5. Central limit theorem? 



If the (non-degenerate) self-adjoint operator A has the following spectral resolution A = 
C'k\k){k\, our random variable A{'4') can be written as a weighted sum of the random variables 
Xkiip) '■= \{4'\k)\'^. In the limit of large Hilbert space dimension d it is easy to see that the Xk's 
decouple. Indeed if h ^ k one has 



XhX, 



{i,\k){k\^){m{hw 
i + p 



while for h 



tr[|V)(V'|^2|fc)(fc| ^ 
1 

" d(d + iy 

2/d{d + 1) from eq. (20) with n = 2. Whence one obtains for the covariance: 



tr 



did+1) 



\k){k\(g>\h){h\ 



(25) 



cov{Xh, Xk) :— XkXh — Xk Xh — 
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for ft, ^ fc and cov{Xk, Xk) = O (2/(i^), i.e. the off-diagonal terms decay faster than the diagonal 
ones. This fact is a consequence of a well known, general result. In physics a manifestation of 
this phenomenon is the fact that the large- iV limit of O {N) invariant field theories (such as the 
non-linear sigma model) is a free, Gaussian, theory (see e.g. [Ml). Probabilists generally tend to see 
this result in the opposite direction, namely: in the large N limit, the Gaussian measure becomes 
highly concentrated on a hyper-sphere of given radius. Given these considerations it is natural to 
expect that the variable A satisfies a central limit theorem (CLT), at least for a certain class of 
operators A. By CLT we mean here that the rescaled random variable 



Z:= {A-A)/y^^^AA), 

(k„ (.4) rt-th cumulant) tends in distribution to a Gaussian with zero mean and unit variance as 
d — )■ oo. 

Instead of trying to give a complete characterization of the precise conditions on A for the CLT 
to apply we will content here with a few examples (see however [7|). Consider first the class of A 
with ak = k°' (with say a > 0). Using formulae ([ij and Q we obtain, at leading order ki (A) cx d", 
K2 {A) oc d^""^, and K3 (A) cx d^"^^ from which we get K3 (Z) cx 1/Vd, i.e. the third cumulant 
of Z vanishes for large d. The approach to Gaussian for this case can be read off from figure |2] 
A potentially interesting class of operators consists of those A for which the average scales as the 
volume of the space i.e. the log of the Hilbert's space dimension. As a last example we consider 
then Gk — logfc. Correctly in this case one has A cx lege?. Proving that the third cumulant of Z 
goes to zero is now more complicated. However Euler-Maclaurin formula is sufficient to prove the 
following result valid at leading order 

J2t 1 (In(fc) - 1)^ 1 
K2 (A) = / , '^^ = - + smaller terms 

d(d+ 1) d 

K3 {A) = ^-JT+ smaller terms 

^ ' d{d + l){d + 2) d^ 

where x = In A: = In (di). Hence we see that also in this case K3 (Z) cx l/y/d. The approach 

to Gaussian is depicted in figure [2] 



6. Conclusions 



In this paper we considered the quantum expectation value of an operator A with respect to a 
pure state \ip). We computed exactly the probability density of the expectation value when 
is drawn from the space of pure states according to the unique (Haar-induced) unitarily invariant 
measure. Generically, the resulting probability distribution is a compactly supported, piecewise 
polynomial function. We used the exact result to test the tightness of the concentration bounds 
obtained by Levy's lemma for a couple of particular cases, namely for A one-dimensional projector 
and for A — N the number operator. Levy's lemma reproduced a qualitatively correct scaling 
behavior with the dimension of the Hilbert, although with very small pre- factor as compared to the 
exact ones. The quadratic scaling with the error e predicted by Levy's lemma was seen to reduce 
to linear scaling for sufficiently small e in the cases studied. We have also noticed that quantum 
expectation of A can be regarded as a linear combination of random variables that decouple in the 
limit of large Hilbert space dimension. Here we limited ourselves to discuss a couple of examples of 
quantum operators whose expectation value fulfill a central limit type of result in such a limit i.e., 
a properly rescaled expectation becomes normally distributed. 
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Figure 2. Approach to Gaussian and central limit theorem for the rescaled variable Z := 
— .A) I \/ K.1 (v4). Left panel: the operator A has eigenvalues a]^ = k'^ . Right panel: a^. = In (fc). 
This latter case roughly corresponds to physical observables in the sense that the average value 
scales with the volume of the space, i.e. A ^ Xad ^ V . The continuous line is the limiting case 
given by a standard normal distribution with zero mean and unit variance. 



Before concluding it is important to mention that the Haar-induced measure over quantum pure 
states is well-known to be unphysical in different ways. In fact, sampling this measure with local 
quantum gates requires exponentially long random circuits [17!. Also, Haar typical quantum states 
are nearly maximally entangled |18J, while low energy eigenstates of local quantum Hamiltonian 
fulfill area laws |19l [201 |5T] i.e., they have low entanglement. In view of these remarks one may 
question the physical relevance of the results presented in this paper and look for more constrained 
prior measures over the |'0)'s. For example, in view of applications to foundations of statistical 
mechanics of closed systems, it would be interesting to generalize our results to "sections of constant 
energy" where one draws pure states uniformly under some constraint of the form (Vjl/f j?/") = E as 
e.g. in the spirit of [22 . 

A more ambitious goal would be to determine the distribution of expectations restricted to a 
set of "physical" states endowed with some "natural" measure. Instances of those ensembles are 
given in |24l 1251 126j where local random quantum circuits and matrix product states respectively 
have been considered. In these cases the lack of the maximal unitary invariance of the Haar measure 
represents the major obstruction one has to overcome. 
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Appendix A. Some useful identities 

Here we want to prove some identities which provide several important relations for the coefficients in 
. We begin by considering the following function for y, real M {y, ft) := e(-^~^)*' 
which is somehow the moment generating function with a shift. The corresponding Gaussian integral 
is well defined when y {A — fil) < 0. To satisfy this condition we assume Q > maxjaj} and y > 0. 



Eq. m and eq. (12 
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The Gaussian integral converges and gives 



Miy,n) 



{d-iy. f dr 



Tj — iy (aj — f2) 



The integral can be evaluated again with complex integration. The contour integral must be closed 
in the lower half plane, all the poles are on the negative imaginary axis and we get 



Miy,n) = 



{d-iy 



{d-i) 



(A.l) 



The same quantity at leading order in y can be computed by first expanding the integral around 
y — and the integrating. We obtain then 

M{y,n) = l + Oiy) (A.2) 



Equating eqns. ( |A.1[ ) and (A.2) term by term we arrive at 

d 



0<n<d-2 

1 n^d-l 



(A.3) 



These equations have been obtained assuming Q > maxjaj}, but since they are analytic in 57 they 



must be true for all complex Q. In particular, setting i7 = in (A.3 1 we deduce that xW is 



regular at A = 0, and in fact analytic and we obtain eq. (|9|. Considering instead Q > maxjafc} 
(resp. n < min{a/j}) and applying the result to eq. (12 1 we obtain that P (x) — for x > maxjafc} 
(resp. X < minjafe}). 

Appendix B. Degenerate case 



We provide here some additional steps needed to obtain eq. (|T9|. In practice we need to write down 



the differentiation in eq. (16 1. The first step is the following 



M=0 

Then we need the multinomial formula 

M 



a 



M 



M 



rm— mg—O 



As customary in physics we write the multiple sum as 

M M 



mi— mn—O 



M 



Note that when applying this to eq. ( 16 1 we miss the term with j — k so we have i—1 (constrained) 
sums over to,-. The final bit is 



5™^ 



1 
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Putting things together, the derivative reads 

/9(nfc-l) 

rife-] 

E 



M=0 



M 



{uj +mj - 1)! 



Going back we get 



!(n, -1)! 



E 



''-I ^ {uk - 1)! 



J\4=0 



rifc - 1 
Mk 



E ^^^^n 



(n,+m,-l)! (-IP 



Correctly all i factors cancel out and we are left with 



Miy^) = id-l)\J2 E 



k = l M..=0 



/3k (Mk) 



E n 



{rij+mj - 1)! 



} m,\{n,-l)\ {ak-a.jY 



(B.l) 
(B.2) 



which correctly reduces to eq. (A.l) when all Uk = 1. On the other hand we still have 
M {y,Q) = 1 + ^ tr (A — ri) + O (y^j which can be used to show directly that M {y, $7) is regular 
at 2/ = and hence analytic in the whole complex plane. 
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